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Abstract 



Capsids of many viruses assemble around nucleic acids or other polymers. Un- 
derstanding how the properties of the packaged polymer affect the assembly pro- 
cess could promote biomedical efforts to prevent viral assembly or nanomaterials 
applications that exploit assembly. To this end, we simulate on a lattice the dy- 
namical assembly of closed, hollow shells composed of several hundred to 1000 
subunits, around a flexible polymer. We find that assembly is most efficient at an 
optimum polymer length that scales with the surface area of the capsid; signif- 
icantly longer than optimal polymers often lead to partial-capsids with unpack- 
aged polymer 'tails' or a competition between multiple partial-capsids attached 
to a single polymer. These predictions can be tested with bulk experiments in 
which capsid proteins assemble around homopolymeric RNA or synthetic poly- 
electrolytes. We also find that the polymer can increase the net rate of subunit 
accretion to a growing capsid both by stabilizing the addition of new subunits and 
by enhancing the incoming flux of subunits; the effects of these processes may be 
distinguishable with experiments that monitor the assembly of individual capsids. 
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The self-assembly of ordered structures is crucial in biology and is now pro- 
viding a route to develop novel nano structured materials. The success of assem- 
bly is governed by a competition between thermodynamics and kinetics, since 
metastable disordered states (kinetic traps) can impede the formation of an or- 
dered thermodynamic ground state dH-Q). Viral proteins are a paradigm for suc- 
cessful assembly - during the replication of a typical virus, hundreds of proteins 
selectively form a protein shell, or capsid, that encases the viral nucleic acid. 
Single- stranded RNA virus capsids assemble around their RNA, and require RNA 
(or other poly anions ([8-12)) to assemble at physiological conditions. How the 
packaged polymer promotes assembly is poorly understood because assembly in- 
termediates are transient and thus challenging to characterize with experiments. 
Therefore, this paper examines a highly simplified model for capsid assembly 
around a flexible polymer, which yields experimentally testable predictions for the 
relationships of polymer length and solution conditions to assembly kinetics and 
assembly yields. Understanding how the polymer effects encapsulation dynamics 
could spur development of antiviral drugs that block assembly and provide criti- 
cal knowledge to exploit capsids for use as drug delivery vehicles or gene therapy 
vectors. 

In this first simulation study of the dynamics of polymer encapsidation we aim 
for the simplest possible description of capsid assembly around a polymer. Be- 
cause successful assembly must avoid kinetic traps, we require a model with no 
a priori assumptions about assembly pathways or the structures that emerge from 
assembly. The essential ingredients are: (1) the protein and polymer units are 
space filling, (2) the lowest energy state for the capsid is a hollow shell, and (3) 
there are short-ranged attractive interactions (representing screened electrostatics) 
between the polymer and protein units that favor encapsulation of the polymer. 
We arrive at a lattice model for protein and polymer units with pairwise attrac- 
tions (Fig. [T]and section Model), with assembly simulated with dynamic Monte 
Carlo. Because our questions are not specific to a particular capsid symmetry, we 
consider a cubic lattice and 'capsids' for which a cubic shell is the ground state. 
The model is general, however, and could be implemented on a quasicrystalline 
lattice that allows icosahedral symmetry. 

Elegant experiments have studied capsid assembly around ssRNA (e.g. 



2m)), but it is difficult to relate individual nucleic acid properties to assembly 



behavior because nucleic acid molecules with different sequences can have dra- 
matically different secondary and tertiary structures (12 ll) and the fact that cap- 
sids assemble around synthetic polyelectrolytes |8- 10) and nanoparticles (11, 12) 
demonstrates that properties specific to nucleic acids are not required for cap- 
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sid formation or cargo packaging. Therefore in this work we primarily focus 
on experimental model systems in which capsid proteins assemble around syn- 
thetic polyelectrolytes (|8Ml(J) or homopolymeric RNA. To begin to understand 
the effects of RNA-RNA base-pairing, we consider an extension to our model in 
which there are short-ranged attractive interactions between polymer segments. 
Although these attractions specifically represent a linear polymer in a poor sol- 
vent, they could shed light on assembly around RNA molecules, which form com- 
pact structures in solution due to base-pairing (|2ll) . 

Under optimal conditions for assembly, we find that capsid growth and poly- 
mer incorporation proceed in concert, with the polymer forming a dense adsorbed 
layer on the partial capsid intermediate that stabilizes the addition of new cap- 
sid subunits. Whereas assembly is highly efficient under these conditions, longer 
than optimal polymers or strong interactions lead to several characteristic forms 
of kinetic traps. These should be identifiable in capsid assembly experiments 
through electron microscopy. The simulations demonstrate that assembly around 
a polymer can proceed by mechanisms unlike those identified for empty capsid 
formation, such as the diffusion of unassembled protein subunits along the poly- 
mer Q. 

Due to its simplicity and the lattice-implementation, our model provides sig- 
nificant computational speed up as compared to previously developed models, and 
allows simulation of time scales that correspond to minutes in real time and cap- 
sids with up to thousands of subunits. Thus, we are able to build upon prior mod- 
eling studies of capsid-polymer assembly that are equilibrium calculations and/or 
postulate particular assembly pathways and structures (|22M34|) (for a nice review 
see (|35lV ). Because our predictions for assembly yields and assembly rates are 
experimentally testable, we hope to motivate experiments that build upon prior 
experimental work investigating structures (e.g. (|13Ml9l \36b ) and kinetics (|20|) of 
viral proteins assembling around nucleic acids. 



Mechanisms of polymer encapsidation 

To understand the influence of polymer properties on capsid assembly, we per- 
formed simulations for a range of binding energies e b , capsid sizes N c , polymer- 
subunit interaction energies e pc and polymer lengths N p . The parameters e b and 
Eric could be experimentally controlled by varying solution pH or ionic strength 
(0,113). We will discuss simulations with e b = 5.85/cb^ 1 and capsid free subunit 
volume fraction 0.5%; spontaneous assembly of empty capsids at this subunit con- 
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centration requires > 6.5k B T. Except for Fig. |5fc, all simulations consider no 
polymer-polymer attractions (e pp = 0.0/cb^)- For most parameter sets that lead 
to successful polymer encapsidation, assembly first requires nucleation of a small 
partial-capsid associated to the polymer, which is followed by a growth phase in 
which subunits reversibly bind to the partial-capsid (Fig. |2]). Finally, there can 
be a completion phase, during which addition of the final few subunits is delayed 
until the polymer is entirely incorporated within the capsid. 

Throughout the growth phase, the polymer adsorbs onto the capsid interme- 
diate in a dense layer with relatively short loops and one or two long tails. The 
nature of the adsorbed layer is independent of polymer length and partial-capsid 
size until the entire polymer is adsorbed, in the sense that the number of polymer- 
capsid interactions (per subunit in the partial-capsid) is independent of polymer 
length until the entire polymer is adsorbed, as shown in Fig. |3]for e pc = 5.75k B T 
(see also Fig. Sll in the Supporting Material). This independence is observed for 
all e pc for which assembly occurred, although the ratio of polymer-capsid contacts 
to partial-capsid size decreases with decreasing e pc . The formation of a dense 
layer with a small number of long loops or tails for a polymer adsorbing onto a 
small surface was predicted theoretically (|38h . 

Trapped configurations limit the length of polymer that can be sponta- 
neously encapsidated. The fact that the amount of polymer incorporated within 
a partial-capsid is independent of polymer length constrains the length of polymer 
that can be efficiently packaged during assembly. In particular, assembly around 
long polymers frequently results in configurations such as shown in Fig. @k, in 
which capsid closure was faster than polymer incorporation, trapping a polymer 
tail outside of the capsid. Complete polymer incorporation and assembly of the 
final few capsid subunits requires the polymer tail to retrace its contour into the 
capsid. However, few additional polymer-capsid interactions result during poly- 
mer retraction and additional capsid subunit- subunit interactions are only possible 
after the entire tail is inside the capsid. Furthermore, as shown by de Gennes (1391) . 
the time for a polymer tail to retrace its contour is exponential in the length of the 
tail. Hence, there can be a long completion phase in which assembly is stalled 
until the polymer tail completely retracts; polymer incorporation and capsid com- 
pletion are rarely observed in our simulations after a capsid entraps a long polymer 
tail. Complete incorporation becomes even thermodynamically unfavorable above 
a certain polymer length, as first suggested by the equilibrium arguments of van 
der Schoot and Bruinsma (|28|) . 

An example of a second class of configurations that impede complete poly- 
mer incorporation is shown in Fig. @1>. This 'dumbbell' configuration is the usual 
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outcome if two capsids nucleate on the same polymer and grow to significant 
size before coming into proximity; geometries of large partial-capsids are rarely 
compatible enough for a successful fusion event without significant subunit disso- 
ciation. Hence, dumbbell configurations are common for parameter sets for which 
capsid nucleation rates are significantly larger than capsid growth rates (see be- 
low). Completion of a capsid from this configuration is unlikely, since it would 
require complete retraction of the polymer from one of the capsids, which has 
a high free energy barrier (and is not thermodynamically favorable for polymers 
beyond a certain length). 

Configurations with pinched polymer tails (Fig. Hk) usually lead to dumbbell 
configurations if the time to nucleate a new capsid, which is inversely propor- 
tional to the length of the tail (see below) is shorter than that tail retraction time 
(exponential in tail length). We note that the dumbbell capsid configuration re- 
sembles malformed capsids that have been observed in experiments (e.g. Fig. 
l4b) (|36|) . Thus, as discussed below, our prediction that configurations similar to 
those shown in Fig. |4] will increase in frequency as the polymer length increases 
can be tested with imaging experiments. 

Polymer encapsidation efficiency is nonmonotonic with respect to poly- 
mer length and polymer-subunit interaction strength. An experimentally ac- 
cessible measure of assembly effectiveness is the packaging efficiency, or the frac- 
tion of trajectories in which a polymer is incorporated inside a complete capsid. 
Simulated packaging efficiencies are shown as a function of polymer length for 
varying capsid sizes (Fig. [5^) and time (Fig. S9), with a complete capsid defined 
as a hollow shell with no gaps. For all times and capsid sizes, there is an optimal 
polymer length for which efficiency is maximal. Polymer lengths are normalized 
by the inner capsid surface area in Fig. [5k to show that the optimal polymer length 
is proportional to the number of polymer-subunit contacts in a complete capsid, 
which is proportional to the capsid size N. Note that the polymer radius of gy- 
ration, (see Fig. S8), is as much as thirty times the radius of the capsid for the 
longest polymers, consistent with the experimental observation that polystyrene 
sulfonate molecules with radii of gyration much larger than capsid size can be 
incorporated in cowpea chlorotic mottle virus capsids ABU- 

The fact that encapsidation efficiency is proportional to the driving force of 
polymer-subunit attractive interactions rather than capsid volume is consistent 
with theoretical equilibrium studies for polyelectrolyte encapsidation (l26l |28L |4(J). 
noting however that polymer segment- segment electrostatic repulsions are ac- 
counted for only by segment excluded volume in our model. This observation 
can be understood by noting that the interactions in our model (and electrostatic 
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interactions at physiological conditions) are short ranged; hence polymer-capsid 
interactions are confined to the layer at the inner capsid surface. However, both 
thermodynamics and kinetics play an important role in limiting packaging effi- 
ciencies in our simulations. Below the optimal length, increasing polymer length 
provides a stronger thermodynamic driving force for assembly and enables faster 
nucleation and growth rates, as discussed below. Larger than optimal polymers 
also drive rapid capsid growth, but they tend to engender traps (discussed above) 
that block assembly. Similarly, the thermodynamically optimal length increases 
with polymer- subunit interaction strength e pc , but the optimal polymer length 
measured in our dynamical simulations decreases with increasing e pc (Fig. [5t>) 
because stronger interactions increase the propensity for kinetic traps. The inter- 
action strength e pc could be experimentally controlled by varying ionic strength. 

Polymer-polymer attractions enable packaging of longer polymers. Pack- 
aging efficiencies are shown as a function of polymer length for a polymer with 
segment- segment attractions (e pp = 0.075A; B T) in Fig. Efc, where we see that poly- 
mers well above the length threshold for the case without attractions are packaged 
with nearly 100% efficiency. This dramatic improvement in efficiency occurs be- 
cause interaction with the assembling capsid causes the polymer to collapse into 
compact configurations (Fig. [6]), reducing the likelihood of incomplete polymer 
incorporation. This effect is even more significant for stronger attractions (past 
the coil-globule transition). We note that polymer attractions increase nucleation 
rates by decreasing the free energy of the critical nucleus (see the next section) and 
thus can increase the frequency of double nucleation. For this reason the results 
in Fig. [5k are shown for a decreased value of e pc = 5.25. Furthermore, because 
the volume of an assembled capsid may vary somewhat, very long polymers are 
sometimes encapsidated because they drive the formation of a capsid larger than 
the optimal size dictated by subunit-subunit interactions. This effect is much more 
prominent here than in simulations without polymer segment-segment attractions. 

Nucleation and growth rates 

To understand the effect of system parameters on overall assembly rates, we mea- 
sured durations of each phase of assembly (nucleation, growth, completion) as 
functions of polymer length and polymer- subunit interaction strength. For each 
trajectory, we define the nucleation time r nuc as the last timepoint for which the 
largest cluster was smaller than the critical nucleus size (n nuc = 8 subunits, see 
Fig. S10). The growth time then corresponds to the interval between nucleation 



Mechanisms of capsid assembly around a polymer 



7 



and containment, where a polymer is 'contained' when within an assemblage of 
subunits that does not permit passage of 2x2x2 or larger cube; this definition dis- 
tinguishes the growth phase from the completion phase described above. 

Nucleation and growth rates increase with polymer length and polymer- 
subunit interaction strength. As shown in Fig. [7} nucleation rates (r~J) increase 
linearly with polymer length and exponentially with polymer- subunit interaction 
strength: r^l ~ A^ p exp[— a(n nuc — l)e pc /A; B T]. These dependencies can be under- 
stood by noting that spontaneous nucleation of empty capsids does not occur for 
these system parameters, so nucleation requires that small partial-capsid interme- 
diates are stabilized through interactions with the polymer. Consistent with mod- 
eling studies of empty capsids (I4ll. l42f) and assembly on nanoparticles (|43l .l44l). the 
system rapidly builds up pre-nucleation intermediates with Boltzmann-weighted 
concentrations c n ~ CqN v exp[— (G n + o>ne vc ) / k B T] with n the intermediate size, 
G n the subunit-subunit interaction free energy of intermediate n, and the num- 
ber of adsorbed intermediates is proportional to the polymer length N p for a fixed 
polymer concentration. The parameter a is the number of polymer- subunit in- 
teractions per capsid subunit. The nucleation rate can then be expressed as (|43r) 
r nuc — c o c n nuc -i (see the SI for further analysis). 

Growth rates also increase with increasing polymer length and polymer- subunit 
interaction strength, but saturate at a limiting value (Fig. |7k). This trend reflects 
two mechanisms by which the polymer can influence capsid growth. First, bind- 
ing to the polymer stabilizes partial-capsid intermediates; this is a thermodynamic 
effect that increases the net rate of assembly by decreasing the rate of subunit 
desorption from adsorbed intermediates. The net stabilization is proportional to 
the number of subunit-polymer contacts in a partial-capsid intermediate, which is 
independent of polymer length until the polymer is completely incorporated (Fig. 
[3]). Therefore, capsid growth times are significantly longer for polymers that are 
completely incorporated before about two thirds of the capsid has assembled, but 
depend only weakly on polymer length for longer polymers. Similarly, the effect 
of increasing e pc saturates when the unbinding rates of polymer-stabilized subunits 
become small compared to association rates. 

Subunit sliding. The second effect of the polymer on growth times is purely 
kinetic; as proposed by Hu and Shklovskii (|22l) the polymer can enhance the flux 
of subunits to binding sites on partial-capsid intermediates, through sliding or cor- 
related diffusion of subunits along the polymer. To characterize the role of sliding 
in our simulations, we performed simulations with extra 'sliding' move attempts, 
in which a subunit interacting with a polymer is moved forward or backward by 
one polymer segment, and a subunit orientation is chosen at random from the set 



Mechanisms of capsid assembly around a polymer 



8 



of orientations that enable a subunit-polymer attraction; the move is accepted or 
rejected according to the Metropolis criteria (full details in the SI). The subunit 
sliding rate (one-dimensional diffusion constant) was varied by changing the fre- 
quency of sliding moves relative to ordinary subunit motions; as shown in Fig.[7fc, 
nucleation and growth rates increase with the relative sliding rate. This observa- 
tion indicates that subunit sliding enhances assembly rates, but shows that sliding 
affects nucleation and growth rates differently; the effect on nucleation will dom- 
inate under typical conditions, for which nucleation is rate limiting. 

We note that another mechanism by which a polymer could enhance the flux 
of subunits to a partial capsid is through correlated polymer- subunit motions (i.e. 
the poly mer reels an attached subunit in like a hooked fish). Previous works 
(I25L I33L 1431) have suggested that in the limit of high polymer- subunit affinities 
subunits adsorb on a polymer en masse and then collectively reorient to form a 
capsid. As discussed in Model our Monte Carlo simulations do not effectively 
explore this mechanism because they use single particle moves (0, Esl). We are 
currently exploring the importance of collective moves using off-lattice Brown- 
ian dynamics simulations of capsid assembly around the polymer [unpublished]. 
We believe that the use of single particle moves does not qualitatively affect the 
results for most parameter sets because the dependencies on system parameters 
and failure modes reported here are similar to those obtained with the Brownian 
dynamics simulations. The results suggest that collective moves are significant 
at high polymer- subunit interactions and/or subunit concentrations; however, the 
conclusion that overly strong interactions or high subunit concentrations lead to 
kinetic traps remains valid. 

Viral genome specificity. The observation that assembly rates will be nucleation- 
limited under conditions of optimal for assembly and the preceding analysis of- 
fers one possible mechanism by which viruses can preferentially package their 
genomic RNA over random RNA (e.g. (I45L l46h ). In many viruses, the capsid 
protein has been shown to specifically and strongly bind to a short 'packaging 
sequence' on the viral genome. Denoting the sequence-specific binding free en- 
ergy as e ss , the ratio of assembly rates around viral and random RNA is given by 
r~c (viral) /t~J, (random) = exp[— n s (e ss — e pc )//c B T]c v /c r , with c v and c r the con- 
centrations of viral and random RNA molecules in the vicinity of the assembling 
capsid proteins and n s the length of the packaging sequence. Nearly complete 
specificity for the viral genome will occur for n s (e ss — e pc ) ^> In c v /c r . We are 
exploring this prediction in simulations that explicitly represent packaging se- 
quences, and note that recent observations that viral genomes have smaller sizes 
in solution than other RNA molecules could also play a role. 
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Implications for experiments 

Measuring polymer incorporation efficiencies in experiments. The simu- 
lations described in this work predict that capsids can assemble around flexi- 
ble polymers with high yields, but that packaging efficiencies are nonmonotonic 
with respect to polymer length. These predictions could be tested by measur- 
ing packaging efficiencies in experiments in which capsid proteins from viruses 
with single- stranded genomes assemble around synthetic polyelectrolytes 
or nucleic acids that do not undergo base-pairing, such as homopolymeric RNA. 
Our results suggest that packaging efficiencies will decrease and incompletely in- 
corporated polymers or dumbbell capsids will appear as the polymer length is 
increased significantly beyond the viral genome length. 

Experimentally measuring packaging efficiencies will require distinguishing 
well-formed capsids from failed assemblages (those that have unincorporated poly- 
meric tails or are otherwise incomplete). Failed assemblages could be identifiable 



by TEM or AFM dill) . In the case of assembly around homopolymeric RNA, 
incompletely incorporated RNA can be identified by treating the capsid solution 
with RNase to digest unincorporated polymer, washing out RNase, disassembling 
capsids, and running the remaining polymer on a gel (|47T). 

The simulation results for polymer-polymer attractions (Fig. |5fc) suggest that 
unincorporated polymer tails will be less prevalent in the case of polymers that 
form compact structures. Although this model is not meant to represent RNA, 



base-pairing of viral RNA molecules leads to compact structures in solution (|21l) . 
and thus these results suggest that unincorporated polymer tails and dumbbell cap- 
sids will be less prevalent for RNA. It would therefore be interesting to compare 
the polymer length dependence of packaging experiments involving homopoly- 
meric RNA with those involving viral RNA. It would be important to consider 
various sequences of viral RNA to distinguish the effects of base-pairing from 
effects of recognition sequences. 

Measuring capsid growth rates in experiments. The simulations demon- 
strate that overall capsid formation time distributions are a convolution of the 
time distributions for each of the three phases: nucleation, growth, and comple- 
tion. Since the completion phase is likely to be difficult to monitor in experiments, 
measured assembly time distributions will include nucleation times and growth 
times. Incorporation efficiencies are highest in our simulations when nucleation 
of multiple capsids on a single polymer is unlikely, which requires that nucleation 
times are longer than or comparable to growth times. It may therefore be difficult 
to extract dependencies of capsid growth times on system parameters from bulk 
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measurements (see Refs. f° r a discussion of this constraint for empty cap- 

sids). With experiments that monitor the assembly of individual capsids (e.g. Ref. 
(48)), however, it is possible to separate nucleation and growth phases as we have 
done for the simulations in this work; comparison of results from these exper- 
iments to simulated growth times could elucidate mechanisms of capsid growth 
after nucleation. Although single capsid assembly experiments have thus far relied 
upon confocal microscopy to visualize assembly of capsids on cell membranes, it 
might be possible to visualize the assembly of non-membrane-associated capsids 
using confocal microscopy or total internal reflectance microscopy by tethering 
RNA molecules to a surface. 



Model 

Capsid subunits. We enable simulation of large capsids and long time scales 
by representing capsid protein subunits as rigid bodies with discrete positions on 
a lattice and continuous orientations. Subunits have pairwise interactions com- 
prised of excluded volume, represented by allowing a maximum of one subunit 
per lattice site, and attractions constructed such that the lowest energy states in 
the model are closed shells or 'capsids' with a preferred number of subunits iV c . 
As shown in Fig. [0 the variations of subunit orientations within a model capsid 
can be compared with those in an actual icosahedral capsid. For a closed shell 
with a preferred size to be the lowest energy state, subunit-subunit interactions 
must (1) lead to a preferred large-scale curvature and (2) drive the formation of 
two dimensional manifolds rather than bulk structures. As has been the case for 



off-lattice models of capsid assembly (I5LI49M52I). these requirements are satisfied 



in our lattice model through interactions that simultaneously depend on relative 
orientations and positions of subunits (Eqs. [2]and[T]). The model is designed to 
eliminate the influence of lattice structure on interaction free energies to the great- 
est extent possible. 

To explain the interaction potential, we consider two neighboring subunits i 
and j with respective lattice positions and Vj, and unit orientation vectors f2j 
and Clj which have a relative angle 9 = cos _1 (f2j • flj), as shown in Fig.Q] The 
subunits experience a favorable interaction with energy — e b when two conditions 
(Eqs. [Hand [2]) are satisfied. First, an attraction requires that the subunit orienta- 
tions are nearly consistent with the preferred circumference n c i rc of the capsid 

\9-27r/n cnc \<5 (1) 
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with 5 the orientational specificity parameter. For all simulations in this work 

5 = tt/30. 

The second requirement drives formation of a single layer shell. We define a 
bond vector d b for the i, j interaction, which is obtained by rotating flj by the 
angle (7r + 6 l )/2 around the rotation axis f2 a = (f2j x fi^/lOj x Q.,-|. An attractive 
interaction requires that the displacement vector = r,, — r-j is the neighbor 
displacement vector that most closely parallels the bond vector d b : 

Tj - Ti = (arg max[n • d b ]) (2) 

n 

with {n} the set of 26 neighbor lattice displacement vectors. 

As shown in Fig. Q}), the interaction potential drives subunits to assemble into 
flat single-layer sheets with orientations that gradually rotate, which enables rep- 
resentation of curved structures even on a cubic lattice. The requirement Eq. [2] 
drives the sheet to turn when subunit orientations reach a critical angle. Because 
of the finite angle tolerance 5, the turn is stochastic and model capsids are not 
always perfect cubes. Near this critical angle, a single subunit could satisfy Eqs.Q] 
and[2]with two different subunits each in a different lattice site with slightly differ- 
ent orientations. To avoid this possibility, subunits have 'exclusion zones' located 
on the two of the 26 neighbor sites closest to the forward and backward extensions 
of the orientation unit vector respectively (see SI Figs. S6, S7). A subunit position 
cannot overlap with an exclusion zone of another subunit, but multiple exclusion 
zones can share the same lattice site. The interaction geometry ensures that a sub- 
unit dimer will have the same interaction free energy for any lattice position and 
dimer orientation. 



Polymer. The polymer is represented with the bond fluctuation model (|53 . 



541) . modified so that polymer segments occupy only a single lattice site and have 
allowed bond lengths of 1 and \/2. Configurations in which polymer bonds cross 

are rejected; full details are given in the SI. The polymer radius of gyration scales 

3/5 

as N p as expected for good solvent conditions (Fig. S8). Polymer segments 
are endowed with a unit orientation vector, and experience interactions with en- 
ergy — e pc when (1) a capsid subunit is located one lattice site forward in the 
direction of the polymer orientation vector and (2) the negative of the subunit ori- 
entation vector points toward the polymer segment. This feature represents the 
fact that positive charges located on the inner surface of a capsid interact with 
encapsidated poly electrolytes. To enable polymer- subunit bonds, polymer seg- 
ments may occupy an exclusion zone defined by extending a subunit's orientation 
backward. In simulations with polymer-polymer attractions, polymer subunits 
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experience isotropic attractive interactions with magnitude e pp to other polymer 
subunits within the 26 closest neighbor sites. Otherwise, there are no interactions 
between polymer segments except for excluded volume. 

Simulations. The simulations use dynamic Monte Carlo (MC) moves in 
which a subunit or a polymer segment is displaced to a lattice site randomly cho- 
sen from the set of 26 nearest neighbor sites and the current position, and a new 
orientation vector is chosen randomly from the unit sphere. This procedure as- 
sumes that translation and orientation relaxation times are comparable. Moves 
are accepted or rejected according to the Metropolis criteria (|55l) . To efficiently 
represent a dilute solution of polymer in excess capsid subunits, we use periodic 
boundary conditions and coupled the system to a bath of subunits with concen- 
tration c by performing grand canonical MC moves in which subunits are in- 
serted or deleted (|43l) . To maintain realistic dynamics, insertions and deletions 
are performed only in the outermost lattice layer (defined relative to the mid- 
dle polymer segment) with a frequency consistent with the diffusion limited rate 
(|43l d6t) . To maintain computational feasibility with extremely long polymers, 
some simulations have a box side-length that is shorter than the full extension 
of the polymer. The side length L was chosen based the relationship between 
polymer length N p and the confinement free energy F con f of an un-encapsidated 
polymer F conf = N^ 4 L~ 15 / 4 (57), to maintain F conf < 4A; B T (which is insignifi- 
cant compared to total binding energies and entropies) and L > 23. There were 
no observed instances of multiple polymer images interacting with an assembling 
capsid. 

Potential model extensions. Experiments with cowpea chlorotic mottle virus 
dii [Toh show that the virus can assemble with different triangulation (T) num- 
bers around polymers of different sizes. This possibility could be examined with 
our model by incorporating subunit 'conformational changes' which modify the 
preferred circumference, following Ref. (56). An explicit representation of RNA 
base-pairing would enable predictions of the differences between packaging ho- 
mopolymers and viral RNA, and should include the dependence of molecular 
rigidity on base-pairing as well as recognition sequences. 



Supplementary Material 



Supplementary material is available online at 



http : //www.b randeis .edu/~kivenson/kivenso n20 1 Osi.pdf , 
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Figure 1: The capsid model, (a) The spike proteins of the hepatitis B virus (HBV) 
indicate the orientations of its subunits (image from Viper (59)). (b) A model 
capsid with iV c = 488 subunits (circumference n circ = 36 subunits). Subunits are 
drawn as cones to indicate their orientations and lines are drawn between interact- 
ing subunits. (c) A cross-section view of the model capsid. The spatial variation 
of subunit orientations in (b) and (c) can be compared to that of the spike proteins 
in HBV. (d) The interaction geometry is shown for two model capsid subunits, % 
and j. In this illustration, the orientation vectors fij and flj are in the plane of the 
figure and thus the rotation axis tt a is perpendicular to that plane. The orientation 
of the bond vector d b is determined by the angle 9 between the two orientation 
vectors as described in the text. A favorable interaction for this configuration 
requires that 9 satisfies Eq.Q]and that d b and satisfy Eq.[2l 
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Figure 2: Snapshots from a typical assembly trajectory; a small region of the 
simulation box is shown. Parameters are N p = 350, e pc = 5.75k B T, and N c = 
488. 
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Figure 3: The average number of favorable polymer-capsid contacts is shown as 
a function of partial-capsid intermediate size for assembly trajectories of capsids 
with size iV c = 728, for various indicated polymer lengths. The polymer-subunit 
affinity is e pc = 5. 75 k B T. 
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Figure 4: Examples of configurations that rarely lead to complete polymer in- 
corporation, (a) Capsid closure progressed faster than complete polymer incor- 
poration, trapping an exposed polymer tail, which blocks insertion of the final 
subunit. (b) Two partial-capsids nucleated on the same polymer and grew to 
nearly complete capsids. Parameters for both cases are N c = 488, N p = 400, 
and e pc = 5.75k B T. (c) Malformed polymer-capsid assemblies observed experi- 
mentally (figure adapted from Ref. (I36h ). 




Figure 5: Packaging efficiencies depend on polymer length, capsid size, polymer- 
subunit affinity, and time, (a) Packaging efficiencies are shown as a function of 
the ratio of polymer length to inner surface area, for capsids with indicated capsid 
sizes iV c and e pc = 5.75k B T. (b) Packaging efficiencies are shown as a function 
of polymer length N p and polymer- subunit affinity e pc for capsids with size iV c = 
296. Results are shown for tf — 2 x 10 s MC sweeps, (c) Packaging efficiencies are 
compared with polymer-polymer attractions (e pp = 0.075k B T, e pc = 5.25k B T) 
and without (e pp = 0.0k B T, e pc = 5.75k B T). The lowest-energy capsid size is 
N c = 386, but somewhat larger capsids can form around long polymers. 





Figure 6: Snapshots from a typical assembly trajectory with polymer-polymer 
attractions; a small region of the simulation box is shown. Parameters are N p = 
410, e pc = 5.25/c B r, £ pp = 0.075k B T, and lowest energy capsid size N c = 386. 
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Figure 7: (a) Average rates for the nucleation and growth phases of assembly are 
shown as functions of polymer length, with N c = 488 and e pc = 5.75/cbT. (b) 
Nucleation times decrease exponentially as subunit-polymer affinity increases, (c) 
Capsid growth rates depend on the subunit sliding rate. The median nucleation 
and growth times are shown for simulations with sliding moves (defined in the 
text) attempted with frequencies of 0, 1, and 10 (relative to attempt frequencies 
for ordinary subunit motions). Parameters are N p = 100, e pc = 5.75k Q T, and 
N c = 296. 



